Multimodal Input for Perceptual User Interfaces

نویسندگان

  • Joseph J. LaViola
  • Sarah Buchanan
  • Corey Pittman
چکیده

Ever since Bolt’s seminal paper, ”Put that there: Voice and Gesture at the Graphics Interface”, the notion that multiple modes of input could be used to interact with computer applications has been an active area of human computer interaction research (Bolt 1980). This combiniation of different forms of input (e.g., speech, gesture, touch, eye gaze) is known as multimodal interaction and its goal is to support natural user experiences by providing the user with choices in how they can interact with a computer. These choices can help to simplify the interface, provide more robust input when recognition technology is used, and support more realistic interaction scenarios because the interface can be more fine tuned to the human communication system. More formally, multimodal interfaces process two or more input modes in a coordinated manner which aim to recognize natural forms of human language and behavior and typically incorporate more than one recognition-based technology (Oviatt 2003). With the advent of more powerful perceptual computing technologies, multimodal interfaces that can passively sense what the user is doing are becoming more prominent. These interfaces, also called perceptual user interfaces (Turk and Robertson 2000), provide mechanisms that support unobtrusive interaction where sensors are placed in the physical environment and not on the user. The prior chapters in this book have focused on various input technologies and associated interaction modalities. In this chapter, we will examine how these different technologies and their input modalities, specifically speech, gesture, touch, eye gaze, facial expressions, and brain input can be combined and the types of interactions they afford. We will also examine the strategies for combining these input modes together, otherwise known as multimodal integration or fusion. Finally, we will examine some usability issues with mutlimodal interfaces and methods for handling them. Research in multimodal interfaces spans many fields including psychology, cognitive science, software engineering,

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Effect of Perceptual Structure on Multimodal Speech Recognition Interfaces

A framework of complementary behavior has been proposed which maintains that direct manipulation and speech interfaces have reciprocal strengths and weaknesses. This suggests that user interface performance and acceptance may increase by adopting a multimodal approach that combines speech and direct manipulation. This effort examined the hypothesis that the speed, accuracy, and acceptance of mu...

متن کامل

Perceptual Interfaces

In recent years, perceptual interfaces have emerged as an increasingly important research direction. The general focus of this area is to integrate multiple perceptual modalities (such as computer vision, speech and sound processing, and haptic I/O) into the user interface. Broadly defined, perceptual interfaces are highly interactive, multimodal interfaces that enable rich, natural, and effici...

متن کامل

Multimodal Interfaces - A Generic Design Approach

Integrating new input-output modalities, such as speech, gaze, gestures, haptics, etc., in user interfaces is currently considered as a significant potential contribution to implementing the concept of Universal Access (UA) in the Information Society; see (Oviatt, 2003), for instance. UA in this context means providing everybody, including handicapped users, with easy human-computer interaction...

متن کامل

Human-centric framework for perceptually adaptive interfaces

Multimodal interfaces have long held the promise of enhanced and effective human machine interaction. The ultimate goal of multimodal interfaces is to facilitate human activity allowing seamless exchange of information. This goal requires a coordinated development effort that incorporates a thorough understanding of human perceptual system in the design of interfaces. In this manner, multimodal...

متن کامل

Coordination and Fusion in Multimodal Interaction

Multimedia Input Analysis Whereas traditional interfaces support sequential and unambiguous input from devices such as keyboard and conventional pointing devices (e.g., mouse, trackpad), intelligent multimodal interfaces (see www.mitre.org/resources/ centers/it/maybury/tutorial.html) relax these constraints and typically incorporate a broader range of input devices (e.g., spoken language, eye a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013